Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DP

نویسندگان

  • Jiro Kiyama
  • Yoshiaki Itoh
  • Ryuichi Oka
چکیده

We propose a new approach for detecting topic boundaries and keywords in arbitrary speech, with neither recognition nor prosodic processing, aiming at quick access to the content of recorded raw speech. This approach is based on the general tendency that frequently-repeated phrases/words in speech are characteristic of topics in discourse, so it uses pairs of phonetically similar segments (PPSSs) of speech to represent topics in speech. This approach has the advantage of being domain and language-independent and robust against variations in the speaker and background noise, as it needs neither a language nor acoustic model in advance. Experiments using simulated dialogues con rmed the good performance of this approach. We also propose Incremental Reference Interval-free Continuous Dynamic Programming (IRIFCDP) as an algorithm for detecting PPSSs in speech for the above method. IRIFCDP can detect PPSSs e ciently in synchronization with the speech, so it is suitable for handling long speech samples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic prosodic segmentation by F0 clustering using superpositional modeling

In this paper, we propose an automatic method for detecting accent phrase boundaries in Japanese continuous speech by using F0 information. In the training phase, hand labeled accent patterns are parameterized according to a superpositional model proposed by Fujisaki, and assigned to some clusters by a clustering method, in which accent templates are calculated as centroid of each cluster. In t...

متن کامل

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

Semi-Automatic Segmentation System for Syllables Extraction from Continuous Arabic Audio Signal

The paper describes a speaker independent segmentation system for breaking Arabic uttered sentences into its constituent syllables. The goal is to construct a database of acoustical Arabic syllables as a step towards a syllable-based Arabic speech verification/recognition system. The proposed technique segments the utterances based on maxima extraction from delta function of 1st MFC coefficient...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996